NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Safety on the Fly: Constructing Robust Safety Filters via Policy Control Barrier Functions at Runtime

https://doi.org/10.1109/LRA.2025.3597847

Knoedler, Luzia; So, Oswin; Yin, Ji; Black, Mitchell; Serlin, Zachary; Tsiotras, Panagiotis; Alonso-Mora, Javier; Fan, Chuchu (October 2025, IEEE Robotics and Automation Letters)

Free, publicly-accessible full text available October 1, 2026
Can I Trust My Fairness Metric? Assessing Fairness with Unlabeled Data and Bayesian Inference

Ji, D; Smyth, P; Steyvers, M (December 2020, Advances in Neural Information Processing Systems)

Group fairness is measured via parity of quantitative metrics across different protected demographic groups. In this paper, we investigate the problem of reliably assessing group fairness metrics when labeled examples are few but unlabeled examples are plentiful. We propose a general Bayesian framework that can augment labeled data with unlabeled data to produce more accurate and lower-variance estimates compared to methods based on labeled data alone. Our approach estimates calibrated scores (for unlabeled examples) of each group using a hierarchical latent variable model conditioned on labeled examples. This in turn allows for inference of posterior distributions for an array of group fairness metrics with a notion of uncertainty. We demonstrate that our approach leads to significant and consistent reductions in estimation error across multiple well-known fairness datasets, sensitive attributes, and predictive models. The results clearly show the benefits of using both unlabeled data and Bayesian inference in assessing whether a prediction model is fair or not.
more » « less
Full Text Available
Active Bayesian Assessment of Black-Box Classifiers

Ji, D; Logan IV, R; Smyth, P; Steyvers, M (February 2021, Proceedings of the AAAI Conference on Artificial Intelligence)

Recent advances in machine learning have led to increased deployment of black-box classifiers across a wide variety of applications. In many such situations there is a critical need to both reliably assess the performance of these pre-trained models and to perform this assessment in a label-efficient manner (given that labels may be scarce and costly to collect). In this paper, we introduce an active Bayesian approach for assessment of classifier performance to satisfy the desiderata of both reliability and label-efficiency. We begin by developing inference strategies to quantify uncertainty for common assessment metrics such as accuracy, misclassification cost, and calibration error. We then propose a general framework for active Bayesian assessment using inferred uncertainty to guide efficient selection of instances for labeling, enabling better performance assessment with fewer labels. We demonstrate significant gains from our proposed active Bayesian approach via a series of systematic empirical experiments assessing the performance of modern neural classifiers (e.g., ResNet and BERT) on several standard image and text classification datasets.
more » « less
Full Text Available

Search for: All records